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What is the optimal way to measure the galaxy power spectrum? 
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Measurements of the galaxy power spectrum contain a wealth of information about the Universe. 
Its optimal extraction is vital if we are to truly understand the micro-physical nature of dark matter 
and dark energy. In Smith & Marian (2015) we generalized the power spectrum methodology of 
Feldman et al. (1994) to take into account the key tenets of galaxy formation: galaxies form and 
reside exclusively in dark matter haloes; a given dark matter halo may host galaxies of various 
luminosities; galaxies inherit the large-scale bias associated with their host halo. In this paradigm 
we derived the optimal weighting and reconstruction scheme for maximizing the signal-to-noise on a 
given band power estimate. For a future all-sky flux-limited galaxy redshift survey of depth 6j > 22, 
we now demonstrate that the optimal weighting scheme does indeed provide improved S/N at the 
level of ~ 20% when compared to Feldman et al. (1994) and ~ 60% relative to Percival et al. (2003), 
for scales of order k ~ 0.5 hMpc“'^. Using a Fisher matrix approach, we show that the cosmological 
information yield is also increased relative to these alternate methods - especially the primordial 
power spectrum amplitude and dark energy equation of state. 


Introduction — The matter power spectrum is a funda¬ 
mental tool for constraining the cosmological parameters. 
It contains detailed information about the large-scale ge¬ 
ometrical structure of space-time, as well as the phe¬ 
nomenological properties of dark energy and dark mat¬ 
ter. Given a galaxy redshift survey two things are crucial: 
how to obtain an unbiased and optimal estimate of the 
information in the matter fluctuations. 

State-of-the-art galaxy redshift surveys, such as the 
Baryon Oscillation Spectroscopic Survey w here¬ 
after BOSS], Galaxy And Mass Assembley [H hereafter 
GAMA], and WiggleZ [3, have all used the approach of 
Feldman et al. [S] hereafter FKP] to estimate the power 
spectrum. This assumes that galaxies are a Poisson sam¬ 
pling of the underlying density field. Hence, provided 
one subtracts an appropriate shot-noise term, and decon¬ 
volves for the survey window function, one should obtain 
an unbiased estimate of the matter power spectrum. 

In the last two decades, our understanding of galaxy 
formation has made rapid progress since the work of FKP 
and our current best models strongly suggest that galax¬ 
ies are not related to matter in the way they envisioned 
[MB- Furthermore, observational studies have discov¬ 
ered that galaxy clustering depends on various physical 
properties: e.g. luminosity [I2UT^ . colour [m 
morphology OESIEI], and stellar mass [25] etc. 

Percival et al. |5S1 hereafter PVP] attempted to correct 
the FKP framework to take into account the effects of 
luminosity-dependent clustering. In a recent paper m 
hereafter SMI5], we argued that the approach of PVP, 
whilst appearing qualitatively reasonable, is in fact at 
odds with our current understanding of galaxy formation, 
and so non-optimal. More recent studies by Seljak et al. 
P5] . Hamaus et al. |53| and Gai et al. [30] suggested that 
weighting the galaxy density field by a linear function of 
halo mass would reduce stochasticity. 

In SMI5 we developed a new scheme incorporating a 
number of the key ideas from galaxy formation: galax¬ 


ies only form in dark matter haloes [B; haloes can host 
galaxies of various luminosities; the large-scale bias asso¬ 
ciated with a given galaxy is largely inherited from the 
bias of the host dark matter halo. 

In this work we demonstrate that our new optimal 
estimator indeed provides both improved signal-to-noise 
(hereafter S/Af) estimates of the galaxy power spectrum 
and boosted cosmological information content, when 
compared with the FKP and PVP approaches. 

This letter is broken down as follows: First, we pro¬ 
vide a brief overview of the results from SM15. Next, we 
evaluate the S/Af expressions for the various weighting 
schemes, followed by the cosmological information from 
a putative all-sky galaxy redshift survey. Finally, we con¬ 
clude. 


Optimal power spectrum estimation — In the origi¬ 
nal work of FKP, the starting concept is that galaxies are 
simply an independent point sampling of the underlying 
galaxy field. Hence, ^ 

”gW , (1) 

where Ng is the number of galaxies, and is the position 
of the ith galaxy in the survey. From this field one then 
may construct an effective galaxy over-density field: 


= 0(r)w(r) ng(r) - an^ir) 


( 2 ) 


where 0(r) is a survey mask function, which is 1 if the 
galaxy lies inside the survey volume and 0 otherwise, a is 
a scaling factor for the spatially random galaxy field ns(r) 
and w(r) is an optimal weight function that depends on 
r. If we now follow the FKP logic and compute the power 
spectrum of the field, one finds that it is related to 
the galaxy power spectrum Pg{k) through the relation: 


\:FrW = 


d^k' 

(27r)" 


Pg(k') g™^(k-k') 




shot ’ 

( 3 ) 
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where is the weighted version of the Fourier 

transform of the survey mask function 0(r), and 
is an effective shot noise correction. If one subtracts the 
shot noise and deconvolves for the survey window func¬ 
tion, then one may obtain an estimate of Pg(fc). It is im¬ 
portant to realise that the above procedure is only correct 
under the assumption that the galaxy power spectrum 
does not depend on any observable, e.g. galaxy lumi¬ 
nosity, colour, spectral type, host halo mass, etc. and 
that shot noise is as was given by FKP. If these assump¬ 
tions are wrong, then the functions Pg(fc), and 

-^shot '^P these dependencies, resulting in a 

biased and sub-optimal reconstruction of the ‘true’ power 
spectrum [31) . As noted earlier, current observational ev¬ 
idence indicates that clustering strength does depend on 
the sample selection. Hence, FKP must be biased and 
sub-optimal (PVP also came to a similar conclusions). 

We now summarise the SMI5 formalism, designed 
to account for a number of these effects. Consider a 
large survey volume containing Ng galaxies that are con¬ 
strained to be distributed inside dark matter haloes. 
Thus the *th dark matter halo of mass Mi and position 
of the centre of mass x^, will have Ng{Mi) galaxies. The 
jth galaxy will have a position vector Vj relative to the 
centre of the halo and a luminosity Lj. For this more 
complicated distribution, Eq. 0 can be generalized to: 

ng(r, P,x, M) = ^ 5°(x — Xi)S^{M — Mi) 

i=l 


where the four Dirac delta functions, going from right 
to left, are sampling: the luminosity of each galaxy in a 
given halo; the spatial location of a given galaxy relative 
to the halo centre; the halo mass from the distribution of 
masses; and the halo centre in the survey volume. 

In direct analogy with FKP’s Eq. Q, we define an ef¬ 
fective galaxy over-density held: 


Pg(r) = J dL J d^x J dMe{r\L) 


w{r, L, X, M) 

Va 


ng(r, L, X, M) — ans(r, L, x, M) 


( 5 ) 


where 0(r|P) is the luminosity-dependent survey geom¬ 
etry function; A is a normalisation constant; a is a scal¬ 


ing factor for the random halo catalogue; w is a general 
weight function. The function rig is the same as rig, ex¬ 
cept the spatial locations of the halo centres have been 
randomized and the number density has been scaled up 
by a factor of 1/a. As shown by SM15, in the large-scale 
limit, when the distribution of galaxies in haloes adopts a 
Dirac-delta-function-like behaviour, the J^g power spec¬ 
trum can be written: 


l■^g(k)|' 


d^q 

(27r)3 


P(q) ^(%(k-q) 


T Fshot 7 


( 6 ) 

where P(q) is the true matter power spectrum, which 
is convolved with the effective survey window function 
and Pghot is a new effective shot noise. The set 
of survey window functions that are required to evaluate 
these expressions can be written in general: 


dLQ{r\L)^(L\M)w^{r, L, x, M) 


( 7 ) 


where in the above n{M, x) and b{M, x) are the mass 
function and large-scale bias of haloes of mass M at ra¬ 
dial position X from the observer (x here is also act¬ 
ing as coordinate time); N^\m) gives the nth factorial 
moment of the halo occupation distribution (hereafter 
HOD); $(L|M) gives the conditional probability density 
that a galaxy hosted in a halo of mass M has a luminos¬ 
ity L. Using these functions the effective shot noise term 
can be written: 


Tshot = (1 + a) 


(2^)3 ^(1.0) 


(q) + G 


( 1 ) 

( 2 . 0 ) 



( 8 ) 


We also introduce the normalisation-free window func¬ 
tions which enables us to write: 


A= fd^r 


. We thus see that, similar to the 


EKP approach, in order to recover the matter power 
spectrum, one must subtract the effective shot-noise term 


and deconvolve for the square of the effective survey win¬ 
dow function 

In the large-scale limit and under the assumption that 
the matter density field is Gaussianly distributed, SM15 
also showed that the iS/AC can, for an arbitrary weight 
w, be written in general as: 


V(k) 


NJ 2(27r)3 


d^r 


^(bi)W 
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d^r 
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-k 


(1 + Q^) 
P[k) 




( 9 ) 


where U(A:) = 47rfc/Afc 1-k (Afc/fci)^/12 


is the volume of the ith fc-space shell in which P{k) is estimated. 
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Comparison of weighting schemes — The failure 
of the FKP scheme to characterise the true clustering 
strengths of galaxies means that it is a biased and sub- 
optimal estimator. We will now show explicitly, under 
the assumption that the SM15 description of the galaxy 
population is the correct one, that both the FKP and 
PVP weighting schemes do indeed lead to sub-optimal 
measurements of P{k). The weighting schemes are: 

• The FKP weights: These depend only on the posi¬ 
tion of the galaxy in the survey: 


Wfkp(i’) = l/[l + «g(r)P(fc)] , (10) 

where ng(r) is the mean number density of galaxies. 

• The PVP weights: These depend explicitly on the 
luminosity dependence of the galaxy bias and also the 
position in the survey: 


Wpyp{r,L) = b{L)/ 1-Png(r)6|(r)P(fc) 


( 11 ) 


where the luminosity-dependent galaxy bias is b{L) = 
J dMn{M)b{M)N^^\M)<S>{L\M)/^{L). 5f(r) = 

dL6^(L)$(L)/ng(r) is the average square of the 
luminosity bias. The galaxy luminosity function is given 
by $(L) = / dMn{M)N^^\M)<^>{L\M), and 
was introduced after Eq. 0 - 

• Optimal weights: In the large-scale limit, these 
weights depend only on the galaxy’s spatial position and 
its host halo mass, and not explicitly on its luminosity. 
The weights are: 


i(;opT(r, M) = b{M)/ [1 -K R{M)S{r, M)] [1 -P neff(r)P(fc)], 

( 12 ) 

where R{M) = N^\M)/n2'\m) is the ratio 

of the second and first factorial moments of the 
halo occupation distribution and we introduced 
the effective number density of galaxies: neff(r) = 

/ dMn{M)b^{M)N^^'> (M)5(r, M)/[l + R{M)S{r, M)]. 
We defined iS(r, M) = dL^(L\M) as the fraction 

of galaxies hosted by haloes of mass M that are observ¬ 
able at a spatial position r, with Pi„in(r) the minimum 
luminosity that a galaxy could have and still be observ¬ 
able given the survey flux-limit. Explicitly, Pmin(r) = 
[(iL(r)/l/i“^Mpc] ^ [T©], where 
TOiim is the apparent magnitude limit of the survey, 
M© is the absolute magnitude of the sun, h is the 
dimensionless Hubble parameter and dL(r) = {l + z)x{z) 
is the luminosity distance in flat cosmological models. 
Note that S{0,M) = 1 and 5(oo,M) = 0. For more 
details on the S/Af expressions for the three weights 
considered, we refer the interested reader to SM15. 

We now show the S/Af on the galaxy power spectrum 
corresponding to the FKP, PVP and SM15 methods for 
weighting the galaxy distribution. As a concrete exam¬ 
ple we consider a flux-limited, full-sky galaxy redshift 
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FIG. 1. S/M level for the optimal (blue lines, ratio> 1) and 
the PVP (red lines, ratio < 1) weighting schemes relative to 
the FKP one, as a function of survey bj flux-limit. The solid- 
dashed, dot-dashed and dotted line styles denote results for k £ 
{0.2, 0.3, 0.4, 0.5}/iMpc“^, respectively. The optimal scheme of 
SM15 clearly maximises the S/M ratio for the future survey cases 
considered. 


survey spanning the redshift range 2 = 0.3-0.9. In or¬ 
der to evaluate the above expressions we need to specify 
several model ingredients. For the evolution of h{M) 
and b{M) we use the models of Sheth and Tormen |32| . 
For the conditional probability distribution $(L|A4) and 
the first factorial moment of the HOD Ng^\M), we use 
the Conditional Luminosity Function (CLF) model of 
Yang et al. [33]. For the second factorial moment we 




(i)i 


g (M) 


where 


use the model: Ng^\M) = /3{M) 
from fitting to semi-analytic models of galaxy formation 
= 1/2logj^Q (M/I0^^/i“^/i“^M©) for the case 
that M < I0^^/i“^M© and unity otherwise [Mj. From 
these ingredients all required variables may be computed. 

Figurej^shows the S/Af for the SMI5 (blue lines) and 
PVP (red lines) schemes ratioed with the S/N for the 
FKP scheme, respectively. The results are presented as 
a function of limiting bj magnitude and for various k- 
mode bins. Clearly, the optimal scheme of SMI5 does 
indeed lead to the largest S/J\f: > 5% improvement over 
FKP at fc ^ 0.2 AMpc”^, and > 20% improvement at 
k ^ 0.5/iMpc“^ for surveys with depth bj > 22. In¬ 
terestingly, the scheme of PVP leads to the least opti¬ 
mal set of estimates, being ^ 20% lower than FKP at 
k = 0.2AMpc“^ and ~ 40% lower by fc = 0.5/iMpc“^, 
again for surveys with 6j > 22. 


Forecasting cosmological information — The ability 
of a set of power spectrum band-power estimates to con- 















4 



kmax [h/Mpc] 

FIG. 2. Forecasted ID marginalized errors and relative errors on cosmological parameters as a function of the maximum wavenumber 
considered in the power spectrum estimates from a full-sky galaxy clustering survey of depth bj ~ 22. The solid red, dashed blue and 
black dotted lines represent the SM15, FKP and PVP weighting schemes, respectively. The panels, going clockwise from the top-left show 
the results for the eight cosmological parameters considered. The largest potential information gains to be had from optimal weighting 
are in the measurements of {wq, Wa, .Ag}. Note that we have not properly taken into account the growth evolution of structure, and used 
power spectrum derivatives suitable for only a single redshift. 


strain the cosmological parameters 9^^ can be forecasted 
through construction of the Fisher information matrix 
[35j ■ For a continuum limit of Fourier modes the Fisher 
matrix can be expressed as [36) : 

^ _ f d^k d\ogP{k)dlogP{k) f 

~ J v{k) de^ ^ ’ ■ 

(13) 

Thus, in order to compute the Fisher matrix, one needs to 
specify the S/N, and the derivatives of the power spectra 
with respect to the cosmological parameters. The former 
were computed in the previous section, and we estimate 
the latter at a single redshift. Therefore our forecasts will 
be pessimistic, since we do not fully take into account 
the information in the growth of structure, but here we 
are only interested in the relative differences between the 
three weighting schemes. 

For our fiducial model we adopt a flat, dark-energy 
dominated cosmological model, characterised by eight 
parameters: 9^ G {u>o, Wi, OdEt ^s, cTs}- 

The first two characterise the equation of state for dark 
energy: w{a) = Pw/Pw = wq + {1 - a)wi\ IIde is 
the dark energy density parameter; and 

are the physical densities in CDM and baryons, re¬ 
spectively; and As, Us, and Og denote the amplitude, 
spectral index, and running of the primordial scalar 
power spectrum, respectively. We adopt the values 


= {-1,0,0.69,0.12,0.02,2.15 x 10-^ 0.96,0}, con¬ 
sistent with Planck data m- The power spectrum 
derivatives we compute through finite differencing matter 
power spectra from GAME [38] . 

Figure shows the forecasted ID marginalized er¬ 
rors on the parameters, as a function of the maximum 
wavenumber fcmax entering the integral of Eq. (131. The 
panels show the fractional error, or if the fiducial value is 
zero, the error. Clearly, the smallest errors are obtained 
when one implements the optimal weighting scheme of 
SM15 (red solid lines), followed by FKP (blue dashed 
line) and then PVP (black dotted lines). We notice that 
the constraints on (Ag, tug, wi) show the most significant 
improvements from the optimal weighting. 

Figure shows the forecasted 2D marginalized errors 
on various parameter combinations. The line styles are 
the same as in Figure Again, the optimal weighting 
of SM15 performs best and the parameters (As,wo,wi) 
appear to be the most affected by the new scheme. 


Conclusions — In this letter we presented an overview 
of the optimal power spectrum estimation scheme of 
SM15. We argued that the FKP scheme was biased and 
sub-optimal since it does not take into account variations 
of clustering with the galaxy sample. We argued that 
the SM15 framework, which encodes several key concepts 
from the theory of galaxy formation, is able to describe 
these variations. We evaluated the S/Af resulting from 
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FIG. 3. Forecasted 2D marginalized errors on cosmological parameters for the eight cosmological parameters considered. The maximum 
wavenumber was set to k = 0.5/iMpc“^ and the flux-limit was taken to be bj = 22. Note that we have not properly taken into account 
the growth evolution of structure, and have used only power spectrum derivatives suitable for a single redshift. Nevertheless, it can be 
clearly seen that the optimal weighting scheme provides the tightest constraints on parameters. 


the FKP, PVP, and SM15 weighting schemes for the case 
of an all-sky galaxy survey. The SM15 weighting scheme 
was found to indeed be the most efficient estimator. We 
then turned to the issue of cosmological information and 
using the Fisher matrix approach showed that the SM15 
scheme also produced the smallest errors on cosmologi¬ 
cal parameters. In particular, the parameters governing 
the amplitude of the primordial power spectrum and the 
evolution of the dark energy equation of state were no¬ 
ticeably improved. 

In the previous studies of [2H1HH] h was claimed that 
weighting galaxy groups by some linear function of the 
halo mass would lead to reduced shot-noise and hence 
boosted signal-to-noise estimates. In the limit of large 
number of galaxies per halo, the mass dependence of the 
SM15 weights is w oc b{M)/Ng^\M), whereas in the 


limit of small numbers w oc b{M). Clearly, the SM15 
weighting scheme does not follow this mass scaling. The 
calculation of SM15 has maximised the S/Af on power 
spectrum estimates, albeit under certain assumptions, 
whereas the calculations of Seljak et al. [55] and later 
Hamaus et al. |55| have minimised the stochasticity on 
the halo density field. These two things are not obviously 
the same. We argue that our approach is the correct path 
to follow since, by design, it minimises directly errors in 
the power spectrum and it has clearly built into its frame¬ 
work the corner stones of galaxy formation theory. 
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